Restoring an Elided Entry Word in a Sentence for Encyclopedia QA System

نویسندگان

  • Soojong Lim
  • Changki Lee
  • Myoung-Gil Jang
چکیده

This paper presents a hybrid model for restoring an elided entry word for encyclopedia QA system. In Korean encyclopedia, an entry word is frequently omitted in a sentence. If the QA system uses a sentence without an entry word, it cannot provide a right answer. For resolving this problem, we combine a rule-based approach with Maximum Entropy model to use the merit of each approach. A rule-based approach uses caseframes and sense classes. The result shows that combined approach gives a 20% increase over our baseline.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hybrid Machine Translation System Based on a Monotone Decoder

In this paper, a hybrid Machine Translation (MT) system is proposed by combining the result of a rule-based machine translation (RBMT) system with a statistical approach. The RBMT uses a set of linguistic rules for translation, which leads to better translation results in terms of word ordering and syntactic structure. On the other hand, SMT works better in lexical choice. Therefore, in our sys...

متن کامل

Design and Implementation of an Intelligent Part of Speech Generator

The aim of this paper is to report on an attempt to design and implement an intelligent system capable of generating the correct part of speech for a given sentence while the sentence is totally new to the system and not stored in any database available to the system. It follows the same steps a normal individual does to provide the correct parts of speech using a natural language processor. It...

متن کامل

Effective Term Weighting for Sentence Retrieval

A well-known challenge of information retrieval is how to infer a user’s underlying information need when the input query consists of only a few keywords. Question Answering (QA) systems face an equally important but opposite challenge: given a verbose question, how can the system infer the relative importance of terms in order to differentiate the core information need from supporting context?...

متن کامل

مطالعه تطبیقی مراحل دانشنامه‌ نگاری در دانشنامه جهان اسلام و دایره‌‌المعارف اسلام (چاپ لیدن)

Purpose: the present paper compares two reference books the Encyclopedia World of Islam &  Encyclopedia  of Islam– Leiden  with regard to the whole process of compiling the encyclopedia, and to conduct an evaluative content analysis . Methodology: This is a comparative survey and a study of content analysis. The data collection tool in the comparative survey section is a questionnaire, and in...

متن کامل

Children Want to Access Every Interpretation Adults Do: Children’s Knowledge of Ambiguity in ACD Constructions

Syntax presents difficulties for the child acquiring language because it requires the induction of an underlying grammatical system from a corpus of sentences. Certain aspects of this acquisition are especially challenging, because they require the learner to induce representations that have no correlate in the surface structure. Antecedentcontained deletion has such properties: it involves two...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005